NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimizing CUDA Graph Scheduling with Reinforcement Learning - A Case Study in SSTA Propagation

https://doi.org/10.1109/MLCAD65511.2025.11189194

Chiu, Cheng-Hsiang; Morchdi, Chedi; Chang, Chih-Chun; Yu, Cunxi; Zhou, Yi; Huang, Tsung-Wei (September 2025, IEEE)

Free, publicly-accessible full text available September 8, 2026
Boosting the Performance of Reinforcement Learning-based Task Scheduling using Offline Inference

Morchdi, Chedi etal (May 2024, IEEE HPEC)

Full Text Available
Reinforcement Learning-Generated Topological Order for Dynamic Task Graph Scheduling

https://doi.org/10.1109/HPEC62836.2024.10938506

Chiu, Cheng-Hsiang; Morchdi, Chedi; Zhou, Yi; Zhang, Boyang; Chang, Che; Huang, Tsung-Wei (September 2024, IEEE)

Full Text Available
A Resource-efficient Task Scheduling System using Reinforcement Learning : Invited Paper

https://doi.org/10.1109/ASP-DAC58780.2024.10473960

Morchdi, Chedi; Chiu, Cheng-Hsiang; Zhou, Yi; Huang, Tsung-Wei (January 2024, IEEE)

Full Text Available
Exploring Gradient Oscillation in Deep Neural Network Training

Morchdi, Chedi; Zhou, Yi; Ding, Jie; Wang, Bei (September 2023, 59th Annual Allerton Conference on Communication, Control, and Computing (ALLERTON))

Full Text Available
Exploring Gradient Oscillation in Deep Neural Network Training

Morchdi, Chedi; Zhou, Yi; Ding, Jie; Wang, Bei (September 2023, IEEE)

Understanding optimization in deep learning is a fundamental problem, and recent findings have challenged the previously held belief that gradient descent stably trains deep networks. In this study, we delve deeper into the instability of gradient descent during the training of deep networks. By employing gradient descent to train various modern deep networks, we provide empirical evidence demonstrating that a significant portion of the optimization progress occurs through the utilization of oscillating gradients. These gradients exhibit a high negative correlation between adjacent iterations. Further- more, we make the following noteworthy observations about these gradient oscillations (GO): (i) GO manifests in different training stages for networks with diverse architectures; (ii) when using a large learning rate, GO consistently emerges across all layers of the networks; and (iii) when employing a small learning rate, GO is more prominent in the input layers compared to the output layers. These discoveries indicate that GO is an inherent characteristic of training different types of neural networks and may serve as a source of inspiration for the development of novel optimizer designs.
more » « less
Full Text Available

Search for: All records